An Experimental Study of Old and New Depth Measures

نویسندگان

  • John Hugg
  • Eynat Rafalin
  • Kathryn Seyboth
  • Diane L. Souvaine
چکیده

Data depth is a statistical analysis method that assigns a numeric value to a point based on its centrality relative to a data set. Examples include the half-space depth (also known as Tukey depth), convex-hull peeling depth and L1 depth. Data depth has significant potential as a data analysis tool. The lack of efficient computational tools for depth based analysis of large high-dimensional data sets, however, prevents it from being in widespread use. We provide an experimental evaluation of several existing depth measures on different types of data sets, recognize problems with the existing measures and suggest modifications. Specifically, we show how the L1 depth contours are not indicative of shape and suggest a PCA-based scaling that handles this problem; we demonstrate how most existing depth measures are unable to cope with multimodal data sets and how the newly suggested proximity graph depth addresses this issue; and we explore how depth measures perform when the underlying distribution is not elliptic. Our experimental tool is of independent interest: it is an interactive software tool for the generation of data sets and visualization of the performance of multiple depth measures. The tool uses a hierarchical render-pipeline to allow for diverse data sets and fine control of the visual result. With this tool, new ideas in the field of data depth can be evaluated visually and quickly, allowing researchers to assess and adjust current depth functions.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Effects of an Early Family-centered Tele-intervention on the Preverbal and Listening Skills of Deaf Children Under tow Years Old

Objectives: Achieving optimal outcomes in deaf children’s communication skills depends on the availability of early specialized high-quality intervention services. Early intervention services through teletechnology could respond to this need. The development of teletechnology has led to the creation of new formats for family-centered services. Such measures could address the hearing, speech, an...

متن کامل

Monitoring Depth of Anesthesia by Nonlinear Correlation Measures

Background: Monitoring the depth of anesthesia (DOA) takes an important role for anesthetists in order avoiding undesirable reactions such as intraoperative awareness, prolonged recovery and increased risk of postoperative complications.The Central Nervous System (CNS) is the main target of anesthetic drugs, hence EEG signal processing during anesthesia is helpful for monitoring DOA. In order t...

متن کامل

به‌کارگیری روش غیرخطی منحنی بازگشتی برای شناسایی مؤلّفه‌های حافظه‌ای برمبنای تک ثبت

Abstract: The purpose of this study was to apply recurrence plots on event related potentials (ERPs) recorded during memory recognition tests. EEG signals recorded during memory retrieval in four scalp region were used. Two most important ERP’s components corresponding to memory retrieval, FN400 and LPC, were detected in recurrence plots computed for single-trial EEGs. In addition, the RQA was ...

متن کامل

تشخیص خودکار الگوهای پاتولوژیک ریوی در تصاویر HRCT بیماران مبتلا به ILD

Abstract: The purpose of this study was to apply recurrence plots on event related potentials (ERPs) recorded during memory recognition tests. EEG signals recorded during memory retrieval in four scalp region were used. Two most important ERP’s components corresponding to memory retrieval, FN400 and LPC, were detected in recurrence plots computed for single-trial EEGs. In addition, the RQA was ...

متن کامل

Magnetic rocks distribution and depth to basement analysis on an old Quarry Site, Abeokuta, SW Nigeria

Geomagnetic study was carried out to investigate the distribution and depth of formations of different magnetic rocks on an old quarry site, Abeokuta, Southwestern, Nigeria. Eight ground magnetic profiles were established with 10 m spacing intervals orientated in West-East and North-South directions, and ranged between 110 and 190 m. A total of 223 data sets were acquired and corrected for all ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006